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Eigenstate thermalization: Deutsch’s approach and 
beyond 
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Universitat Bielefeld, Fakultat fiir Pliysik, 33615 Bielefeld, Germany 

Abstract. The eigenstate thermalization hypothesis (ETH) postulates that the 
energy eigenstates of an isolated many-body system are thermal, i.e., each of them 
already yields practically the same expectation values as the microcanonical ensemble 
at the same energy. Here, we review, compare, and extend some recent approaches to 
corroborate this hypothesis and discuss the implications for the system’s equilibration 
and thermalization. 
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1. Introduction 

The relaxation of a macroscopic many-body system towards thermal equilibrium is a 
very common phenomenon, but has still not been satisfactorily understood theoretically 
[1, 2]. In particular, isolated systems and their text-book description at equilibrium by 
a microcanonical ensemble [3] have recently regained considerable attention [4-6]. An 
immediate first puzzle is the mere fact that the system apparently approaches a steady 
long-time limit though the quantum mechanical time evolution of a non-equilibrium 
initial state is well-known notio become asymptotically time-independent (see e.g. Sect. 
2 below). As one possible way out, one could, for instance, try to show that after a 
sufficiently long “equilibration time”, the expectation values of pertinent observables 
become “practically constant” (fluctuations remain below any reasonable resolution 
limit) for “practically all” later times (exceptions do exists - e.g. due to quantum revivals 
- but are exceedingly rare). Indeed, results of this type have been established under 
fairly weak and plausible assumptions about the initial state, the Hamiltonian, and the 
observables of the considered system [2, 7-12]. As an natural next step, quantitative 
estimates of the above mentioned equilibration times are currently attracting increasing 
interest [10, 13-16]. This is a very important but also quite difficult issue of its own 
right, which goes beyond the scope of our present paper. 

Here, we rather will focus on another natural next issue, named thermalization: 
Given that the expectation value of an observable equilibrates in the above sense, how 
well does this long-time limit agree with the corresponding microcanonical expectation 
value, as predicted by equilibrium statistical mechanics? A sufficient condition for 
a good such agreement is the so-called eigenstate thermalization hypothesis (ETH), 
essentially postulating that the expectation values of pertinent observables exhibit 
negligible variations for all energy eigenstates with sufficiently close energy eigenvalues. 
This hypothesis has its roots in closely related conjectures by Berry and Voros about the 
energy eigenstates of (fully) chaotic systems in the semiclassical limit, see e.g. Eq. (9) in 
[17] and Eq. (6.17) in [18]. Their implications for the (diagonal as well as off-diagonal) 
matrix elements in energy representation for observables with a well-behaved classical 
limit were further explored by Feingold and coauthors [19-21]. The key role of ETH for 
thermalization in high dimensional chaotic systems in the semiclassical regime was first 
recognised by Srednickij; [22-24]. Even earlier, its actual validity was numerically (and 
implicitly) exemplified and adopted as an explanation for the observed thermalization 
in a spin-chain model by Jensen and Shankar [25]. More recently, the seminal paper by 
Rigol, Dunjko, and Olshanii [26] introduced the term ETH, pinpointed its importance 
for thermalization, and stimulated numerous, predominantly numerical studies on the 

f It may be worth noting that in those semiclassical studies [17-24] the term “microcanonical ensemble” 
is used quite differently than in the present paper (see Sect. 3), namely referring to classical phase 
space averages over an infinitely thin energy surface. 
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validity of ETH for a large variety of specific models (mostly spin-chain- or Hubbard¬ 
like), initial conditions (often involving some quantum quench), and observables (mainly 
few-body or local), see e.g. [27-40]. 

Mathematically, the validity of ETH could be demonstrated so far only in special 
cases, namely for the eigenfunctions of the Laplace operator on an arbitrary dimensional 
compact Riemannian manifold whose geodesic flow is ergodic. If also the considered 
observables are sufficiently well-behaving, then ETH can be proven to hold for the vast 
majority of all eigenfunctions with asymptotically large eigenvalues [41-43]. 

Another analytical key work is due to Deutsch, implicitly verifying ETH for the 
vast majority of systems, whose Hamiltonians have been sampled according to a certain 
random matrix ensemble [44-46]. Here, we generalise this approach by Deutsch and 
unravel its close connection with other recent explorations of thermalization, especially 
by Goldstein and coworkers [5, 13, 16, 47-49]. 

2. Equilibration 

We consider a large (macroscopic but finite), isolated system, modelled in terms of a 
Hamiltonian H with eigenvalues En and eigenvectors |n), where n G N and En+i > E^- 
System states - either pure or mixed - are described by density operators pit), evolving 
according to p{t) = Utp{C)Ul with propagator Ut := exp{—iTTt} and h = 1. It follows 
that pmnit) := {m\p{t)\n) is given by pmn(O) exp[—i(i7m — En)t], i.e., unless the system 
was already in a steady state initially, it remains time-dependent forever. In other words, 
non-equilibrium initial states do not seem to “equilibrate” towards a steady long-time 
limit in an obvious way. 

Observables are represented by self-adjoint operators A with expectation values 
Tr{p(t)A}. In order to model real experimental measurements it is, however, not 
necessary to admit any arbitrary self-adjoint operator [50-57]. Rather, it is sufficient 
to focus on experimentally realistic observables in the following sense [7, 58]: Any 
observable A must represent an experimental device with a finite range of possible 
outcomes of a measurement, 

• Oimax Ojrain i ( 1 ) 

where amax and Omm are the largest and smallest eigenvalues of A. Moreover, this 
working range of the device must be limited to experimentally reasonable values 
compared to its resolution limit 5A. Indeed, real measurements usually yield at most 10- 
20 relevant digits, i.e. it is sufficient to consider range-to-resolution ratios Na/5A < 10^°. 

Next we dehne for any given M > 0 and T > 0 the quantity 

Tm := |{0 < t < T : I Tr{p(t)A} - Tr{p,,A}| > <i4}| , (2) 

where |{...}| denotes the size (Lebesgue measure) of the set {...} and where the time- 
independent, so-called equilibrium or diagonal ensemble p^q is defined as the diagonal 
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part of p(0), i.e. (peg)mn := ^mnPnn(O). As detailed e.g. in [11, 12], one then can show 
that 

^ < 2 m^{pnn{^)} (3) 

for all sufficiently large T. For the sake of simplicity, we also have taken here for 
granted that the energy gaps Em — En are hnite and mutually different for all pairs 
m ^ n. Generalisations have been worked out e.g. in [9-12]. 

According to (2), the left hand side of (3) represents the fraction of all times 
t G [0,T], for which there is an experimentally resolvable difference between the true 
expectation value Tr{p(t)A} and the time-independent equilibrium expectation value 
Tr{peqA}. On the right hand side, Aa/SA is the above mentioned range-to-resolution 
ratio and max„{p„„(0)} represents the largest occupation probability of an energy 
eigenstate (note that the pnnif) are conserved quantities). 

For a macroscopic A^-body system there are roughly energy eigenstates 

with eigenvalues in every interval of IJ beyond the ground state energy [3]. Since 
N = 0(10^^), the energy levels are thus unimaginably dense and even the most 
careful experimentalist will not be able to populate only a few of them with signihcant 
probabilities Pnn(O). In the generic case we thus expect [7, 58] that - even if the system’s 
energy is hxed up to an extremely small experimental uncertainty, and even if the energy 
levels are populated extremely unequally - the largest population Pnn(O) will still be 
extremely small (compared to Pnn(O) = 1), overwhelming by far the factor (A^/M)^ 
on right hand side of (3). 

Since the level populations Pnn(O) are the result of the system preparation, a 
more detailed understanding and quantihcation of those terms necessarily requires the 
modelling of such a preparation procedure. We come back to this point in Sect. 7, 
where arguments will be provided that 

max{p„„(0)} = (4) 

n 

can be expected in many cases. 

From (3) together with (4) we can conclude that the system generically equilibrates 
in the sense that it behaves in every possible experimental measurement exactly as if 
it were in the equilibrium state peq for the overwhelming majority of times within any 
sufficiently large time interval [0,T]. 

3. Thermalization 

Next we address the question whether, and to what extent, the above discussed 
equilibrium expectation value Tr{pegA} is in agreement with the corresponding 
microcanonical expectation value, as predicted by the textbooks on equilibrium 
statistical mechanics for our isolated A-body system at hand [3]. 

To begin with, Imic '■= [E—AE, E] denotes the usual microcanonical energy window 
about the (approximately known and thus preset) system energy E, whose width AE 
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is macroscopically small (below the experimental resolution limit) but microscopically 
large (much larger than the typical energy level spacing — En). The number of 

energy eigenvalues En contained in Jmic is denoted as D and is typically very large. The 
corresponding microcanonical ensemble is given by 


Pmic • — 



mic 


(5) 


where the sum Xlmic with En G /mic- In other words, = 1/D 

if En G /mic and Pnn = 0 otherwise. Hence, the expectation value of A in the 
microcanonical ensemble takes the form 

Tr{p.*A} = f/ZA„„ = (6) 

n mic 

On the other hand, recalling that {peq)mn ■= ^mnPnni^) implies 

Tl{PeqA} = ^Pnn(O) Ann ■ (7) 

n 

As usual, we henceforth assume that the system is experimentally prepared at the preset 
macroscopic energy E, i.e. also the Pnn(0)’s are negligibly small for energies En outside 
/mic- However, within I^ic the actual populations Prin(O) are still largely unknown and 
cannot be controlled by the experimentalist. In general we therefore have to admit the 
possibility that they considerably vary in a largely unknown (pseudo-random) fashion 
even between neighbouring n’s. 

The problem of thermalization thus amounts to showing that the difference between 
(6) and (7) is negligible in spite of the lack of knowledge about the Pnn(0)’s. 


4. Eigenstate thermalization hypothesis (ETH) 

As mentioned in the introduction, the ETH consists in the surmise that the expectation 
values Ann of an observable A hardly differ for eigenstates \n) of a many-body 
Hamiltonian H with sufficiently close energy eigenvalues En [22-24, 26, 44, 45]. In 
particular, if the variations of the A„„’s are negligible over the entire microcanonical 
energy window /mic, then the (approximate) equality of (6) and (7) follows immediately. 
In this sense, ETH is a sufficient (but not necessary) condition for thermalization. 

Similarly as for the microcanonical ensemble in (6), ETH also implies the 
equivalence of peq in (7) with a large variety of other pure or mixed steady states, 
whose level populations are mainly concentrated within the energy window /mic- On 
the one hand, this includes other equilibrium ensembles such as the canonical ensemble, 
provided the considered energy interval AE is large enough to accommodate all notably 
populated energy levels. (As we will see later, the latter requirement is in fact quite 
problematic.) On the other hand, even a single energy eigenstate \n) with En G /mic 
will do. In other words [26], such an energy eigenstate encapsulates all properties of the 
considered many-body system at thermal equilibrium! 
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Two rather delicate problems, which any “validation” of ETH has to resolve, are 
as follows: (i) For any given Hamiltonian H, one can readily construct (a posteriori) 
observables A which violate ETH, e.g. Ann = (“I)"" and arbitrary Amn for m ^ n. 
(In contrast to what Ref. [39] might suggest, an ETH-violating observable thus needs 
not be a conserved quantity.) In particular, this example implies that ETH cannot be 
satisfied simultaneously for all observables, and in fact not even for all experimentally 
realistic observables as specified below Eq. (1). (ii) While ETH claims that expectation 
values Ann are (practically) equal for sufficiently close energy eigenvalues En, generically 
there are - of course - notable differences Amm — Ann when Em — En is not small. But 
how can the observable A “feel” whether the two eigenstates |m) and \n) of Ef belong 
to similar energies or not, without any a priori knowledge about the Hamiltonian H ? 

At hrst glance, it thus might seem unavoidable to somehow restrict the set of 
admissible observables. Indeed, the early explorations of ETH [17-24] solely had in 
mind semiclassical (small h) systems, which are classically chaotic, in conjunction with 
observables, which are h-independent and derive from smooth classical phase space 
functions (see Sect. 1). In contrast, the more recent, predominantly numerical studies 
were mainly focused on spin-chain- and Hubbard-like models [27-40] (i.e. without an 
obvious classical limit), and on few-body or local observables. Yet another option would 
be to only admit macroscopic observables, see Sect. 8 below. In either case, it is still 
not obvious whether and why such a restricted class of observables may get around 
the above mentioned problems (i) and (ii). The solution of those problems within our 
present approach will be discussed in Sect. 9. 

5. The approach by Deutsch 

In this section, we reconsider the approach by Deutsch, originally published in [44]. 
For the detailed calculations, announced as Ref. [6] therein, see [45]. For an updated 
summary, see also [46]. 

5.1. Random matrix model 

Following Deutsch [44] we consider Hamiltonians H of the form 

H = Ho + V , ( 8 ) 

consisting of an “unperturbed” part Hq and a “perturbation” V. As before, eigenvectors 
and eigenvalues of H are denoted as \n) and En with En+i > En- Likewise, those of Hq 
are denoted as |n)o and E^ with > E^. 

Typical examples one has in mind [44] are Hq which describe a non-interacting 
many-body system, e.g. an ideal gas in a box, while V accounts for the particle-particle 
interactions. Further examples are so-called quantum quenches: Hq describes the system 
for times t < 0, while H applies to t > 0. In other words, some external condition or 
some system property suddenly changes at time t = 0. 
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In various such examples, the perturbation matrix 

VL-=o{'m\V\n)o (9) 

is often expected or numerically found to be a banded matrix [21, 44, 59, 60], i.e., the 
typical magnitude of decreases with increasing \m—n\ towards zero. Furthermore, in 
the above mentioned example where Hq describes a non-interacting many-body system, 
the perturbation matrix is usually very sparse, i.e., only a small fraction of all 
matrix elements is non-zero [60-62]. 

In any case, the perturbation V is required to be sufficiently weak so that the 
two systems H and Hq still exhibit similar thermodynamic properties at the considered 
system energy E, in particular similar densities of the energy levels, see above Eq. (4). 

As a next step, the common lore of random matrix theory is adopted [44, 47, 61]: 
One samples matrices from a certain random matrix ensemble with statistical 
properties which imitate reasonably well the main features of the “true” perturbation 
V (band structure, sparsity etc.), and it is assumed that if a certain property can be 
shown to apply to the overwhelming majority of such randomly sampled l/-matrices, 
then it will also apply to the actual (non-random) V in (8). A priori, such a random 
matrix approach may appear “unreasonable” since most of those randomly sampled 
perturbations V amount to systems which are physically very different from the one 
actually modelled in (8). Yet, in practice such a random matrix approach turned out 
to be surprisingly successful in a large variety of specihc examples [61], and hence, as in 
Deutsch’s work [44], will be tacitly taken for granted from now on. 


5.2. General framework 

The randomness of V entails via in (8) a randomisation of the eigenstates \n) of H and 
hence of the basis-transformation matrix 


Umn ■= {rn\n)Q . ( 10 ) 

Likewise, any given observable A and its matrix elements in the unperturbed basis 

: = o(m|A|n)o (11) 

are non-random quantities, while 

Amn ■= {rn\A\n) (12) 


will be the elements of a random matrix, inheriting the randomness of the 17-matrix via 


A 


mn 


E u.njA%u:,. 

jk 


(13) 


Demonstrating ETH thus amounts to showing that A^m — Ann is small for most 
l/’s and sufficiently close m and n. Formally, this will be achieved by considering the 
variances 


•= ((^nn “ {Ann)vY)v = ((^nn)^)v “ (^nn)y , 


( 14 ) 
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where indicates the average over the random perturbations V. In a first step (Sect. 
5.4), we will show that the mean values {Ann)v for sufficiently close n’s differ very little 
in comparison to the experimental resolution limit 5A introduced in Sect. 2. In a second 
step (Sect. 5.5), we will show that cr„ -C M, implying that Ann differs very little from 
{Ann)v for most V. Altogether, this will imply the desired result that for most Id’s the 
Ann’s change very little upon changing n (Sect. 5.6). 

To simplify the algebra, we henceforth assume that the largest and smallest 
eigenvalues of A in (1) satisfy 

O^min ^max • (f^) 

As a consequence, ttmax = Ayi/2 according to (1). Note that the assumption (15) does 
not imply any loss of generality, since adding an arbitrary constant (times the identity 
operator) to the observable A, and hence to all its eigenvalues, does not entail any non¬ 
trivial physical consequences. In particular, the above mentioned changes of {Ann)v 
upon variation of n and the variances (14) remain exactly the same. For later use, we 
thus can conclude that 

MA-^m<{AA/2r (16) 

for any G N. 

For the sake of simplicity, we furthermore assume that all matrix elements V^n from 
(9) and Umn from (10) are real numbers. For example, for systems without spins and 
magnetic helds, Hq and V in (8) are both purely real operators in position representation 
and hence the eigenstates |n)o and \n) can be chosen so that all V^n Umn become 
real. So, it is natural to assume that also the corresponding random matrix ensembles 
only involve real matrix elements. In particular, this implies with Eqs. (13) and (14) 
that 

{Ann)v= Y.A]^{Un,Unk)v (17) 

jk 

=Y, A^AUUmUn.UnkUnlW " {Ann)"v • (18) 

ijkl 

It may well be that our subsequent calculations can be readily extended to systems 
for which such a transformation to purely real matrices and Umn is no longer 
possible. However, the so far available knowledge, e.g. from random matrix theory 
or numerical investigations, regarding the statistical properties of the UmnS is only 
sufficient for our purposes for real matrices (see next Section). Since the subject of our 
paper is not the exploration of such statistical random matrix properties but rather their 
implications with respect to ETH, we confine ourselves to the case of real matrices. 

5.3. Properties of Umn 

In view of (17), (18), some basic statistical properties of the matrix elements Umn are 
needed in order to make any further progress. 
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At this point it is crucial to note that the Hamiltonian H in (8) gives rise to 
a very special type of random matrix. Namely, the matrix Q{rn\H\n)Q is the sum of 
the above discussed random perturbation and of the non-random diagonal matrix 
o(m l^ol ^)o = ^mnE^, whose diagonal elements grow approximately linearly with n 
(at least within a sufficiently small vicinity of the preset system energy E, onto which 
we tacitly restrict ourselves, see also Sects. 3 and 5.6). Out of the huge literature on 
random matrix theory, only a relatively small number of works pertains to this special 
case, see e.g. [46, 60, 62, 63] and further references therein. Strictly speaking, they are 
obtained for infinitely large matrices whose statistical properties do not depend on 
m and n separately, but only on the difference m — n. Likewise, the unperturbed matrix 
o(m|iLo|’^)o is assumed to be inhnitely large and of the form SmuE^ with equally spaced 
energy gaps — E^. Intuitively, these seem quite plausible approximations, at least 
for not too strong perturbations V in (8). They can be readily justihed by numerical 
examples, but somewhat more rigorous analytical results do not seem to exist. Here, we 
adopt the widely accepted viewpoint that, for out present purposes, they can be taken 
for granted [44, 60, 61]. 

As a consequence, also the statistical properties of the Umn only depend on m — vr, 
e.g. the i^-th moments are of the form 

{{Umny)v = u^m - n) , (19) 

where the Uy{n) are real (but not necessarily even [63]) functions of n, and are 
furthermore non-negative for even v. 

Known analytical results mainly concern the second moment U 2 {n) for various 
ensembles of possibly banded and/or sparse random H-matrices, see e.g. [46, 60, 62, 63] 
and references therein. In all cases, it is found that U 2 {n) is monotonically decreasing 
for n > 0 and monotonically increasing for n < 0, hence exhibiting a global maximum 
at n = 0: 


max{M 2 (n)} = ^2(0) . 

n 

Since Ylk UmkUnk = ^mn we can conclude that 

= 1. 


( 20 ) 

( 21 ) 


implying that U 2 {n) must approach zero for large \n\. 

In all those analytical results, the mean values are tacitly assumed to vanish 

and it is found that U 2 {n) then only depends on the second moments ((V][]„)^)y. Since we 
are not aware of any justihcation for this assumption = 0, we have numerically 

investigated various examples and found that, indeed, the statistical properties of the 
f/mn’s seem to be independent of the first moments (while keeping all other 

cumulants hxed). Furthermore, a simple physical argument is as follows: Replacing 
an eigenstate |n)o of Hq by — |ri)o is supposed not to change any physically relevant 
properties of the given (non-random) model in (8). Note that this argument applies 
separately to any single n. Hence, it is quite plausible that upon randomly flipping 
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the signs for half of all n’s, the resulting “new” will be “unbiased” for m ^ n. 

(More precisely: if a random matrix description works at all, then an ensemble with 
{VD V = 0 seems most appropriate). Finally, a possibly remaining systematic “bias” 
of the diagonal elements can be removed by adding an irrelevant constant to V. 
(The typical magnitude of the ^Iso estimated in Appendix B of [45], however, 

not taking into account the possibility that their average may be zero.) 

For the rest, the detailed properties of U 2 {n) are found - as expected - to still 
depend on the quantitative details of {(y^^Y)v- Since no general statement about the 
latter seems possible for the general class of systems we have in mind with (8), we will 
focus on conclusions which do not depend on the corresponding details of U 2 {n). Rather, 
we will only exploit the following very crude common denominator of all so far explored 
particular classes of random matrices Vmn, see e.g. [46, 60, 62, 63] and further references 
therein: 

M2(0) = . (22) 

The basic physical reason is that exceedingly “weak” perturbations V in (8) are tacitly 
ignored so that the smallest relevant energy scale is the mean level spacing E^+i — En, 
being of the order of J according to Sect. 2. Moreover, the ratio between this 

energy scale and any other relevant energy scale of the system can be very roughly 
estimated by independently of any further details of the specihc model system 

in (8). As a consequence, also the very crude estimate (22) is independent of these 
details. 

Further statistical properties of the Umn, which we will, similarly as in [44, 45, 60, 
62, 63], take for granted later on, are: 

(i) Their average is zero, i.e., 

ui{n) = 0 for all n. (23) 

(ii) They are statistically independent of each other, i.e., 

Umn is independent of Ujk ii m ^ j or n ^ k. (24) 

(iii) Their distribution does not exhibit long tails, i.e., 

u^in) < cu 2 {n) U2{0) for all n (25) 

with an n-independent constant c, which may possibly be very large but which is required 
not to be so large that it can compete in order of magnitude with 1 /m 2 ( 0 ) from (22). 
For instance for a system with N = 10^^ particles, it would be sufficient that c < 10^°^^. 
In other words, we adopt the very weak assumption 

cn2(0) = . (26) 

E.g., for a Gaussian distribution (with zero mean, cf. (23)) one hnds that u^i^n) = 3ul{n) 
and hence (25) is satisfied for c = 3. Though a Gaussian distribution is often taken for 
granted [22, 44, 45], non-Gaussian distributions have been actually observed e.g. in 
[59] and also in our own numerical explorations (unpublished), but the more general 
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condition (25) was still satisfied in all cases. We also note that since U 2 {n)/u 2 {C) 
approaches zero for large |?7,|, the condition in (25) becomes weaker and weaker with 
increasing \n\. 


5.4- Mean values 


Evaluating (17) by means of (19), (23), and (24) yields 

{Ann)v = '^^^%{UnjUnj)v + ^%{Unj) V {Unk) V 


j¥=k 


J2K^2in-j) 


It follows that 


(27) 


)y {-^nn^V 

= 5Z^?iN(rJ +1 - j)-M2(n-j)] (28) 

3 

and hence that 

|A„| < '^\AC\\u2{n + I - j) - U2{n - j)\ 

3 

< max \A^jj\ ^ \u 2 {k + 1) - ^2(^)1 • (29) 

^ k 

The maximum over j can be estimated from above by Aa/2 according to (16). Recalling 
that U 2 {k) is monotonically decreasing for k > 0 and monotonically increasing for k < 0 
(see above (20)), the sum over k amounts to 2 ^ 2 ( 0 ) (more generally, this sum amounts to 
the total variation of U 2 {k)] hence, if U 2 {k) exhibits M local maxima, it can be estimated 
from above by 2M maxfcM 2 (fc)). Altogether, we thus can conclude that 

I {An+l,n+l)v — {Ann)v\ < AaU2{0) . (30) 

This upper bound is tight: One can readily hnd examples A for which (30) becomes an 
equality. Moreover, it follows that 

I {Airirn} V {Ann) V \ — ^ I Aa U 2 (0) . (31) 

By generalising the line of reasoning in (29), (30) one can also show that 

K, 

\{Amm)v — {2^nn)v\ < ^ U 2 {k) (32) 

k=—K 

with K := \m — n\ — 1. Under certain conditions, the bound (31) may be better than 
(32), but never by more than a factor of 2. For sufficiently large \m — n\, (32) is always 
better since the sum on the right hand side is bounded by unity (see (21)), while the 
right hand side of (31) is unbounded. (The relevance of large |m —?7,|-values will become 
apparent in Sect. 5.6 below). In any case, (32) is a rather tight bound in the sense that 
one can hnd examples for AT so that the left hand side is larger than the right hand 
side divided by 2 for a set of suitably chosen pairs (m, n) so that the differences m — n 
may still take any integer value. 
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5.5. Variances 

We rewrite the variance from (18) as 




+ {Ann)l = Y, AlAli{Ur,.UnjUnkUni)v (33) 

ijkl 

and evaluate the four-fold sum by distinguishing 4 possible cases. Case 1\ i ^ k and 
i = j. In this case, we only have to keep summands with I = k: otherwise the factor 
Unk on the right hand side of (33) would be independent of the remaining three factors 
according to (24), and the corresponding summand would vanish according to (19) and 
(23). Case 2\ i ^ k and i ^ j. As before, we can conclude that only summands with 
I = i and j = k give rise to non-vanishing terms. Case 3: i = k and i ^ I, implying, 
as before, that only j = I contribute. Case 4: i = k and i = I, implying j = i. 
Consequently, we can rewrite (33) as 

+ {^nn)v = -|- S 2 -|- S 3 -|- S 4 , (34) 

where the four summands correspond to the above four cases and can be rewritten as: 






S2= Y^''^kAl{UmUnkUnkUm)v 


i^k 


S3= 






S4 = Y^l^l^UnrUmUniUm)v • 
i 

With the help of (24) and (19) we can rewrite (35) as 
Si = ^ AlAl^U2{n - i)u2{n - k) 

i^k 

= Y - i)u2{n - k) - ^(A 

ik i 

= {Ann)l - - i) , 

i 

where we exploited (27) in the last equation. Likewise, one hnds that 

T.2=Y- *)“2(n -k)- - i) 

ik i 

^3 = 5^(^°z)^M2(n - i)u2{n - 1) - 0 

il i 

S 4 = - i) . 


( 35 ) 

( 36 ) 

( 37 ) 

( 38 ) 


( 39 ) 

( 40 ) 

( 41 ) 

( 42 ) 


Introducing these results into (34) thus yields 

^‘n— Si + S 2 + S^ 


( 43 ) 
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-^1 := \^^ik?U2{n - i)u2{n - k) (44) 

ik 

S 2 := - i)u2{n - 1) (45) 

ii 

^3 ■= -i)- ^ul{n - z)] . (46) 

i 

The three factors on the right hand side of (44) are all non-negative and hence 

= l^il < max{u2{k)}^U2{n - . (47) 

i k 

With (11) one readily sees that the last sum over k amounts to o{i\A?\i)Q =: (A^)°. 
Exploiting (20) we thus obtain 

-^1 < ^ 2 ( 0 ) ^(71^)° U2{n - i) . (48) 

i 

Likewise, since U 2 {i) > 0 for all i, the modulus of (45) can be estimated as 

|5'2| < ^ \A%\^U2{n - i)u2{n - 1) = Si . (49) 

il 

Turning to (46), we hrst note that the last factor ^4 := M 4 (n — i) — Su^^n — i) 


represents the 4th cumulant of the random variable Uni- For a Gaussian distribution 
(the case considered by Deutsch [44, 45]), this cumulant vanishes, but for more general 
distributions it may be hnite and of either sign. We thus estimate 1^41 from above by 
M4(?7, —z)-|-3 m 2(’^ —*)• Observing (26) andzz 2 (n —z) < U 2 {Gi)u 2 {n — i) (see (20)), we thus 
can bound (46) by 

1 *^ 3 ! < M2(0)(c + 3) ^(71°) V(n - z) . (50) 

i 

Next, we invoke the Cauchy-Schwarz inequality to conclude 

(51) 

for arbitrary Hermitian operators B and vectors |^). In particular, it follows that 
{Alf < (712)0.. Altogether, we thus arrive at 

< (c + 5) zz2(0) 5^(A2)0 zz 2 (n - z) . (52) 

i 

Finally, we exploit (16) and (21), resulting in 

^l<{c + 5) zz2(0) (A^/2)2 = Al , (53) 

where we used (26) in the last step. 

5.6. Discussion 

From (14), (53), and Markov’s inequality it follows that 

Prob (|4„„ - (Ann)v\ >i)< (A^/e)^ iQ-oi") 


( 54 ) 
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for any e > 0, where Prob(X) denotes the probability that a randomly sampled 
V in (8) entails property X. For instance, if in the last term 0{N) = 10^^ and 
e = Xa then the right hand side of (54) is still Consequently, the joint 

probability that every Ann is practically indistinguishable from {Ann)v simultaneously 
for all n G {no,no + An} still remains negligibly small if 0 < An < 

On the other hand, (22) and (31) imply that the difference {Amm)v ~ {Ann)v 
remains below the experimental resolution limit 6A of A (cf. Sect. 2) even for quite 
large range-to-resolution ratios Aa/SA, provided |m — n| remains much smaller than of 
the order of 10°^^^ In other words, also the variations of the {Ann)v remain negligibly 
small within the “window” of n-values {no, ..,no + An} if 0 < An < 

Altogether, we thus arrive at the conclusion that for the vast majority of randomly 
sampled perturbations V in (8), the A„„’s remain practically constant (below the 
experimental resolution limit) as long as n varies by much less than 10*^^^^ 

The latter property is sometimes referred to as the strong ETH [33, 38]. It 
immediately implies the practical indistinguishability of the two expectation values (6) 
and (7), and hence thermalization, provided both the Pnn(O) and the are negligibly 
small outside a window of n-values much smaller than but otherwise without 

any further restriction on the initial condition p(0). 

If the range An of admitted n-values is not any more much smaller than 
then we can no longer conclude from (53) that with high probability all the A„„’s 
remain simultaneously close to the (A„„)y’s. However, we still can conclude that for 
the vast majority of n’s, those differences remain negligibly small§. If, in addition, also 
the variations of the (A„„)y’s would remain small, we still could conclude that “most” 
Ann’s are practically equal (for the overwhelming majority of H’s), i.e. the so-called weak 
ETH is satished [33, 38]. As a consequence, the expectation values (6) and (7) would 
again be practically equal under certain additional conditions on the initial condition 
p(0). For instance, the total weight of all Pnn(0)'s corresponding to exceptionally large 
differences Ann — (^nn}v should remain sufficiently small. E.g. (4) would obviously be 
a sufficient condition. 

However, this line of reasoning contains a problem; If the variations An of n are 
not any more much smaller than then Eqs. (31) and (22) no longer imply that 

the variations of {Ann)v remain negligible. The same conclusion follows from the bound 
(32). Since the latter bound is already rather tight (see below (32)), we can conclude 
that the restriction to windows of n-values much smaller than is not merely a 

technical problem but rather an indispensable prerequisite of the random matrix model 
from Sect. 5.1. In particular, this restriction also concerns the original hndings by 
Deutsch [44]. 

§ With Xn := \Ann — {Ann)v \ and 5 := (A^/e)^ Eq. (54) can be rewritten as (0(x„ —e))v < <5, 

where 0(x) is the Heaviside step function. Furthermore, := J2n=i ~ 0 counts how many of the 

a;„’s exceed e. It follows that {Z^)v < DS and with Markov’s inequality that Prob(Ze > qD) < S/q, 
where Prob(Ze > qD) is the probability that more than a fraction q of all x„’s exceed e. 
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Note that An from above is identical to the number D of energy eigenvalues 
contained in the microcanonical energy window /mic := [E — AE, E] from Sect. 3. The 
above discussed restriction thus amounts to 

D < (55) 

and implies that AE must remain very much smaller than any macroscopically 
resolvable energy difference. (This follows from the fact that the energy eigenstates 
are exponentially dense in the system size N, see Sect. 2). 

6. Srednicki’s ETH for the off-diagonal matrix elements 

In Refs. [23, 24], Srednicki formulated, besides the so far considered ETH for the 
diagonal matrix elements also a corresponding ETH for the off-diagonal elements 
^mn with m ^ n. This hypothesis can also be readily confirmed within our present 
framework: 

Similarly as in (17), we find from (13) that 

(^mn)y = A%{UmjUnk)v ■ (56) 

jk 

For m n, the last factor {UmjUnk)v equals {Umj)v{Unk)v according to (24), and hence 
vanishes according to (19) and (23). In conclusion 

{A^n)v = 0 (57) 

for all m ^ n. 

Turning to the second moment (variance), one finds similarly as in (14), (18) that 
{\Amn\‘^)v = {AmnAnm)v = A^jAli{UmiUnjUnkUml)v ■ (58) 

ijkl 

According to (24), for m 7 ^ n the last term {UmiUnjUnkUmi)v now factorises into 
{UmiUmi)v{UnjUnk)v- Exploiting (24) once more implies {UmiUmi)v = and 

{UnjUnk)v = ^jk{U^k)v- With (19) we thus obtain 

{\Amn\‘^)v = ^ \A^ii,\'^U2{m - i)u 2 {n - k) . (59) 

ik 

Step by step as in (44), (47), (48), (53) it follows that 

{\Amn\^)v<U2iO){AA/2f (60) 

for all m ^ n. In view of (22) we see that the off-diagonals \Amn\ are typically 
exponentially small in the system size iV, in agreement with Srednicki’s prediction in 
Refs. [23, 24]. 

The overall conclusion applying to any given Hermitian operator A of finite range 
Aa is: The representation of A in the eigenbasis of H is, for the overwhelming majority 
of randomly sampled perturbations V in ( 8 ), very close to a diagonal matrix, whose 
diagonal elements Ann change very slowly with n. 
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7. Implications for the level populations p„ri(0) 

Throughout this section, we consider the density operator p(0) (pure or mixed state) 
from Sect. 2 and abbreviate it as p. 

Since p is a Hermitian operator, all results so far for general observables A are 
immediately applicable to p. However, this particular observable A = p also exhibits 
some subtle special features. Therefore, we hrst focus on a simple example. 

7.1. Simple example 

We consider a pure energy eigenstate of the unperturbed Hamiltonian Hq in (8), i.e. 

P = \m)oo{ni\ (61) 

with an arbitrary but hxed m. Its eigenvalues are either zero or one, hence the range 
from (1) is Ap = 1. Observing that p°^ = Sim^km h follows from (13) and (17)-(19) that 

Pnn = \Umn\^ (62) 

{Pnn)v = U 2 {n - m) (63) 

= U 4 {n — m) — ul{n — m) , (64) 

where the variance from (14) is given for H = p by 

:= iiPnnfh - {Pnn)l • (65) 

As a concrete example, we may focus on Gaussian distributed UmCs (see below Eq. 
(26)), so that M 4 (n) = 3M|(n) and hence 

= 2 u 2 {n — m) . (66) 

Altogether, the standard deviation cr„ of the random variable pnn from (62) is 
thus comparable to its mean value {pnn}v, and both are, according to (20) and (22), 
extremely small compared to the range Ap = 1 of the considered observable p. Within 
any reasonable resolution limit 6p of this observable we thus can conclude that, for the 
vast majority of random perturbations V in (8), all pnn’s are practically equal (namely 
zero), in agreement with the general validation of ETH from Sect. 5. But for our 
present purposes, this usual resolution limit 6p is still way too large. On the actual 
scale of interest, the Pnn’s from (62) are not at all a slowly varying function of n, but 
rather exhibit very signihcant random fluctuations (see also (24)). In particular, it would 
be wrong to argue that the Pnn(0)’s in (7) are now practically constant and hence, upon 
comparison with (6), thermalization follows. 

7.2. General case 

We return to general density operators p, i.e., we only assume that p is a Hermitian, 
non-negative operator of unit trace and purity 

Ti'{/3^} < 1 . 


(67) 
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In the following, we will exploit these properties of p, which, however, wonld be lost 
after adding a constant to p so that (15) is satished. Hence we only can employ those 
previous results which were obtained without the help of (15). Along these lines, one 


hnds exactly as in (27) and (52) that 

{pnn)v = p'jjMn - j) (68) 

j 

(tI < (c + 5)m2(0)^(p^)° M2 (rj-i) , (69) 

i 

where is dehned in (65) and (p^)° := o(*|p^|*)o- Likewise, (57) and (59) yield 

{Pmn)v = 0 (70) 

{\pmn?)v = Y ldifcP“2("i - i)u 2 {n - k) (71) 

ik 

for all m ^ n. 

Introducing (20) into (68) implies 
{pnn)v < max{M 2 (j)} 

j 

where we exploited that the sum over j equals Trp = 1. Likewise, (69) yields 

a2<(c + 5)n2(0)Tr{pn. (73) 

Rewriting the dehnition from (28) as 

■ (Pn+l,n+l)y {Pnn)v 


we hnd along similar lines of reasoning that 

|An| < ^p°j|M2(n +1 - j)-M2(n-j)| 

j 

< max |m 2(’2 + 1) — M 2 (’ 2 )| . (75) 

n 

Moreover, we can conclude that 

Y\^^\ <5Zp?i5^|M2(ri + l-i)-'«2(n-i)| . (76) 

n J* 77 

Similarly as in (30), the last sum is seen to be equal to 2 ^ 2 ( 0 ). The remaining sum over 
j equals Tr p = 1 and hence 

<2^2(0) . (77) 

n 

Eqs. (72), (73) indicate that, in contrast to pure states in (61), for mixed states 
of small purity Tr{p^}, the random fluctuations of the Pnn’s about their mean values 
may become negligible. Moreover, the right hand side of (75) usually turns out 
[44, 45, 60, 62, 63] to be of the order of m^O). The same conclusion is also suggested 
by (77). Consequently, also the variations of {pnn)v as a function of n become small. 
Unlike for pure states we thus can now conclude that (6) and (7) are approximately 
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equal, implying thermalization. However, assuming a small purity of p represents a 
quite strong restriction in the first place. 

Returning to general p, we can deduce from (20) and (68) that 

'^{Pnn)v < X] max{M2(j)} PkkM^ - k) 

n n j k 

< M2(0) p% Y Pkk ■ (78) 

j k n 

The sum over n equals one according to (21) and the two remaining sums are equal to 
Tr p = 1, i.e., 

Y^P--)y ^ “2(0). (79) 

n 

Likewise, Eqs. (67) and (69) imply 

Y^n< (c + 5)u2(0)^(p^)° ^M2(n-i) 

i n 

= (c + 5) M2(0) Tr{p2} < (c + 5) ^ 2 ( 0 ) . (80) 

Altogether, this yields 

X^((Pnn)^)v < (c + 6) ^ 2 ( 0 ) (81) 

n 

and with (25) 

{Y^Pnnf^y ^ • (82) 

n 

Since (max„p„„)2 = max„(p„„)^ < Y^^PunY it follows that 

((maxp„„)^)y < 10"°(^i (83) 

n 

and hence by Markovs’s inequality that 

Prob ^max{p„„} > ej < e~^ , (84) 

see also the explanations below (54). 


7.3. Off-diagonals 


We focus on the off-diagonal matrix elements pmn with m ^ n. Their mean values are 
zero according to (70). Their variance can be readily bounded by the Cauchy-Schwarz 
inequality 


\Pr 


< P '. 


mm Pnn 


(85) 


Likewise, introducing \p^Yf < p^jP^^ into (71) yields with (68) the estimate 
(I Pmn I ')v C: {pmm)v {Pnn)v ■ 


( 86 ) 


Similarly as below (47) one sees that Ymn IPmnP = Tr{p^} and hence 
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According to (82), the last inequality in (87) is in fact a very tight upper bound. The 
same estimate follows by summing in (71) over m and n. Neither of these results indicate 
that the off-diagonal matrix elements are typically much smaller than the diagonal 
elements. We thus conjecture that typical off-diagonal matrix elements will in fact not 
be small compared to the diagonal elements. Trivial exceptions are pure states (61). 
Non-trivial exceptions may be mixed states of low purity, similarly as below (77). 

l.J^. Discussion 

The main result of this section is (84): It implies that for the overwhelming majority 
of randomly sampled perturbations V in (8) the last term in (3) is unimaginably small 
(essentially in agreement with (4)). In other words, equilibration in the sense of Sect. 
2 is verihed. We emphasise that all these conclusions do not depend an any further 
details of the actual initial condition p(0), except that it is assumed to be hxed, i.e. 
independent of the randomly sampled V. 

The physical interpretation is as follows: One specihc, usually not very well known, 
but nevertheless well-dehned initial state p(0) (pure or mixed) is “given” to us, and 
then evolves further in time according to one particular, randomly picked system 
Hamiltonian (8). Our results guarantee that the vast majority of those randomly 
sampled Hamiltonians gives rise to equilibration in the sense of Eq. (3). As the only 
unproven part remains the assumption that the actual (in detail not exactly known) 
system does not correspond to one of the rare, untypical Hamiltonians of the considered 
random ensemble. 

Since Tr{p} = 'YlnP^n = 1 we can conclude from (84) that, for most H’s, the 
number of non-negligible Pnn’s cannot he much smaller than As a consequence, 

the strong ETH scenario from Sect. 5.6 does not apply. In turn, to apply this scenario, 
a different physical set up is required, with a different physical view of how the initial 
condition p arises. Namely, one particular, but “typical” V in (8) is considered to have 
been randomly sampled but now is held hxed. Since we assumed the system is typical, 
strong ETH as specihed in Sect. 5.6 can and will be taken for granted. In a next 
step, the initial state p = p(0) for this particular system is specihed, arising, e.g., as 
the result of an experimental preparation procedure for this very system (with respect 
to other systems H of the ensemble, this preparation procedure may not be physically 
meaningful or not even well dehned). Finally, there must be good reasons (e.g. a very 
careful experimentalist) to assume that this preparation process yields level populations 
pnn which are negligible outside a window of n-values much smaller than see Eq. 

(55). 

In conclusion, our present formalism is able to validate either equilibration or 
thermalization, but not both of them simultaneously for one and the same physical 
model system. 
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8. Comparison with the approach by Goldstein and coworkers 

In a series of works [5, 13, 16, 47, 48] Goldstein and coauthors addressed the problem 
of thermalization by means of an approach which, at hrst glance, seems to be entirely 
different from our present one. In particular, the “intermediate” problem of equilibration 
(see Sect. 2) apparently can be entirely circumvented. One key point of their approach 
is the restriction to so-called macroscopic observables, i.e. observables whose statistical 
fluctuations are negligibly small for macroscopic systems at thermal equilibrium. In 
this section, we will show that the latter dehning property of a macroscopic observable 
implies that it satishes (weak) ETH. In other words, the restriction to such observables 
is essentially equivalent to assuming ETH. 

We dehne the microcanonical mean and variance of any given observable A as 

A := Trlpmic^} (88) 

AH2:= Ai{p^,,{A-Af} , (89) 

where pmic is the microcanonical ensemble from (5). By dehnition, an observable A is 
called a macroscopic observable, if its fluctuations AH are negligibly small. A more 
precise formal requirement would be vanishing fluctuations in the thermodynamic limit. 
A more appropriate real world (experimentally useful) version would be to require 
that the fluctuations are smaller than the experimental resolution limit M, with, e.g. 
5A = 10“^°A^, where A^ is the measurement range of the experimental instrument 
modelled by A, see Sect. 2. 

The above requirement represents a minimal condition: Whatever alternative 
dehnition of a macroscopic observable may be proposed, if it admits non-small 
huctuations in the microcanonical ensemble then it would not seem well-dehned to us. 
Indeed, the dehnitions employed in [5, 13, 16] are quite similar but not exactly identical 
to ours. We also note that the microcanonical ensemble itself is only used here as a 
formal device to dehne the notion of a macroscopic observable. It does not in any way 
anticipate that the actual system of interest should exhibit thermalization. 

Introducing (5) into (88) and (89) implies 


A — ^ ^ Ann 

mic 

(90) 


(91) 


mic 


where (H^)„„ := {n\A?\n). In other words, (90) represents the average over those H„„’s, 
whose energies En are contained in the microcanonical energy window. Their typical 
deviation from this average is quantihed by the variance 

4 = ^ - (^1)" ■ (92) 

mic mic 

From (51) we can conclude that (H„„)^ < (H^)n„ and hence that AAeth < AA. 
Assuming that A is a macroscopic observable thus implies that AAeth is small [33]. 
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According to the definition in (92) it follows that most A„„’s must be close to A, i.e. 
weak ETH is satisfied (see Sect. 5.6). 

Closely related considerations are originally due to [33], however focusing on so- 
called intensive local few-body operators rather than on macroscopic observables. 

Restricting oneself to macroscopic observables clearly has a long and well founded 
tradition, especially with respect to the thermodynamic roots of statistical physics. 
On the other hand, statistical physics itself is by no means restricted to such 
observables. Rather, it is understood and experimentally (and numerically) seen 
that also “microscopic” observables are perfectly well described by this theory. Such 
observables exhibit non-negligible fluctuations about their mean values and as such 
are hardly encountered in our everyday macroscopic world (exceptions may arise near 
critical points). But already with the help of an optical microscope, interesting 
observables exhibiting non-negligible thermal fluctuations (e.g. Brownian motion) 
become accessible. Even more so, within the rapidly developing fields of nanotechnology 
and single molecule experiments, such microscopic observables become of increasing 
practical relevance. 

We finally remark that Goldstein and coworkers conceive their own approach 
[5, 13, 16, 47, 48] as a continuation of von Neumann’s ground-breaking work [49]. 
Other relations between such an approach and ETH than the one discussed above 
have also been pointed out in Refs. [36, 48]. Furthermore, the concept of randomised 
Hamiltonians (or random matrices) also plays a key role in the approach by von 
Neumann and by Goldstein and coworkers [47]. However, rather than introducing this 
randomness directly into the Hamiltonian H itself - as done in the approach by Deutsch 
via Eq. (8) - the randomness is now introduced by prescribing the statistical properties 
of the randomly sampled eigenbases In) of H, see Sect. 5.2, while keeping the spectrum 
of H fixed. In spite of these technical differences, the two approaches are thus in fact 
very close in spirit (see, e.g. Sect. 6 in [47]). 

9. Summary and conclusions 

In a first step, we reconsidered the random matrix model of Deutsch [44-46] and worked 
out a more detailed and slightly more general demonstration that it validates ETH; For 
the overwhelming majority of the corresponding random ensemble of Hamiltonians 77, 
any given observable A is represented in the eigenbasis of 77 by an almost diagonal 
matrix with very slowly varying diagonal elements. More precisely: Apart from a 
fraction of exceptional 77’s which is exponentially small in the system size iV, the off- 
diagonal matrix elements Amn are exponentially small in N and the changes of the 
diagonal elements Ann as a function of n are also exponentially small in N. This implies 
the following solution of problem (i) from Sect. 4: For any given 77, one can readily 
construct (a posteriori) an ETH-violating A (see Sect. 4), but any such A still continues 
to satisfy ETH for most other 77’s. In turn, if 77 is not known in all details with 
extremely high precision, then a given observable is exceedingly likely to exhibit ETH. 
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The generalisation for more than one observable is straightforward: Given every 
single observable is exponentially unlikely to violate ETH, it is still extremely likely that 
all of them will simultaneously exhibit ETH, as long as their number is not exponentially 
large, i.e. remains within the limits of what is feasible in any real (or numerical) 
experiment. 

In a second step, we have shown by means of a further generalisation of Deutsch’s 
approach that also an essential prerequisite for equilibration, namely Eq. (4), will be 
satished for the overwhelming majority of Hamiltonians H. In doing so, an arbitrary 
(pure or mixed) state p(0) is admitted as initial condition. But this initial state p(0) 
must then remain always the same for the entire ensemble of random Hamiltonians H. 

We also identihed a not yet satisfactorily solved aspect of Deutsch’s original 
approach and our present generalisation: On the one hand, the changes of the diagonal 
matrix elements Ann as a function of n are exponentially small in the system size 
N up to exponentially rare exceptions. On the other hand, the typical difference 
between neighbouring energy levels En is also exponentially small in N (cf. Sect. 2), 
i.e. the number of energy eigenvalues contained in an energy intervals [E — AE, E] 
is exponentially large in N for the usual AE’s of interest. Hence, the variations of 
Ann within the entire energy interval may no longer be negligible. As a consequence, 
thermalization, i.e., the practical indistinguishability of (6) and (7), can only be proven 
under the extra condition that the interval of relevant n-values, which notably contribute 
to those sums in (6) and (7), is not too large, namely much smaller than (cf. 

(55)). In other words, only exceedingly small AE’s are admitted. In the following four 
paragraphs, we conclude with four noteworthy remarks and implications. 

In spite of this restriction, the admitted range of n-values is still huge, and likewise 
for the admitted energy intervals AE in comparison with the energy level spacings. In 
particular, they are still of physical interest: For instance, one may imagine that the 
experimentalist has prepared the system with a sufficiently small uncertainty in the total 
system energy E so that the corresponding condition can be safely taken for granted for 
the pnn’s appearing in (6) and (7). 

As mentioned already in in Sect. 4, our Endings imply the equivalence of peq in (7) 
not only with the microcanonical ensemble p^ic in (6) but also with any other equilibrium 
(i.e. steady state) ensemble, provided that its level populations are mainly concentrated 
within a sufficiently small energy window as specified above. Unfortunately, this 
condition is not satisfied e.g. for the canonical ensemble. 

The other, more fortunate side of the coin is that within our present approach 
the diagonal matrix elements Ann are indeed not forbidden to exhibit non-negligible 
variations for sufficiently large changes of n, or, equivalently, for macroscopically notable 
changes of En- This solves problem (ii) from Sect. 4. 

Assuming one and the same initial state p(0) for the entire ensemble of random 
Hamiltonians H, as done in our above discussion of equilibration, implies that the 
number of pnn’s which notably contribute in (7) is not much smaller than for 

most H's. In conclusion, our present generalisation of Deutsch’s approach allows us to 
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corroborate either equilibration or thermalization, but not both of them simultaneously 
for one and the same physical model system. The root of the problem is as before: 
Whether and why the dependence of the diagonal matrix elements Ann on n is neither 
too strong nor too weak is not yet fully satisfactorily understood. A solution of this 
problem is presently being worked out. 
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